Development of Acoustic Model for Croatian Language Using HTK
نویسندگان
چکیده
Paper presents development of the acoustic model for Croatian language for automatic speech recognition (ASR). Continuous speech recognition is performed by means of the Hidden Markov Models (HMM) implemented in the HMM Toolkit (HTK). In order to adjust the HTK to the native language a novel algorithm for Croatian language transcription (CLT) has been developed. It is based on phonetic assimilation rules that are applied within uttered words. Phonetic questions for state tying of different triphone models have also been developed. The automated system for training and evaluation of acoustic models has been developed and integrated with the new graphical user interface (GUI). Targeted applications of this ASR system are stress inoculation training (SIT) and virtual reality exposure therapy (VRET). Adaptability of the model to a closed set of speakers is important for such applications and this paper investigates the applicability of the HTK tool for typical scenarios. Robustness of the tool to a new language was tested in matched conditions by a parallel training of an English model that was used as a baseline. Ten native Croatian speakers participated in experiments. Encouraging results were achieved and reported with the developed model for Croatian language.
منابع مشابه
Croatian Large Vocabulary Automatic Speech Recognition
This paper presents procedures used for development of a Croatian large vocabulary automatic speech recognition system (LVASR). The proposed acoustic model is based on context-dependent triphone hidden Markov models and Croatian phonetic rules. Different acoustic and language models, developed using a large collection of Croatian speech, are discussed and compared. The paper proposes the best f...
متن کاملThe 1998 HTK Broadcast News Transcription System: Development and Results
This paper presents the development of the HTK broadcast news transcription system for the November 1998 Hub4 evaluation. Relative to the previous year’s system The system a number of features were added including vocal tract length normalisation; cluster-based variance normalisation; double the quantity of acoustic training data; interpolated word level language models to combine text sources;...
متن کاملDear reader ,
You have at your desk the issue no. 1/2010 of the journal AUTOMATIKA with which begins my direct responsibility for its future development in the capacity of the Editor-in-chief. I would like to thank the KOREMA presidency for entrusting me with this responsibility. Special thanks to my predecessor Prof. Borivoj Rajković for the enormous effort he invested in regular publishing of the journal a...
متن کاملLecture 8: Speech Recognition Using Finite State Transducers
In order to use HTK-trained speech recognition models with the AT&T speech recognition search engine, three types of conversion are necessary. First, you must convert the HTK-format hidden Markov models into ATT format acoustic models. Second, you’ll need to write finite state transducers for the language model, dictionary, and context dependency transducer. Third, acoustic feature files need t...
متن کاملHindi Speech Recognition System Using Htk
Speech recognition is the process of converting an acoustic waveform into the text similar to the information being conveyed by the speaker. In the present era, mainly Hidden Markov Model (HMMs) based speech recognizers are used. This paper aims to build a speech recognition system for Hindi language. Hidden Markov Model Toolkit (HTK) is used to develop the system. It recognizes the isolated wo...
متن کامل